Provenance in ORCHESTRA

نویسندگان

  • Todd J. Green
  • Gregory Karvounarakis
  • Zachary G. Ives
  • Val Tannen
چکیده

Sharing structured data today requires agreeing on a standard schema, then mapping and cleaning all of the data to achieve a single queriable mediated instance. However, for settings in which structured data is collaboratively authored by a large community, such as in the sciences, there is seldom consensus about how the data should be represented, what is correct, and which sources are authoritative. Moreover, such data is dynamic: it is frequently updated, cleaned, and annotated. The ORCHESTRA collaborative data sharing system develops a new architecture and consistency model for such settings, based on the needs of data sharing in the life sciences. A key aspect of ORCHESTRA’s design is that the provenance of data is recorded at every step. In this paper we describe ORCHESTRA’s provenance model and architecture, emphasizing its integral use of provenance in enforcing trust policies and translating updates efficiently.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Collaborative Data Sharing with Mappings and Provenance

COLLABORATIVE DATA SHARING WITH MAPPINGS AND PROVENANCE Todd J. Green Supervisors: Zachary G. Ives and Val Tannen A key challenge in science today involves integrating data from databases managed by different collaborating scientists. In this dissertation, we develop the foundations and applications of collaborative data sharing systems (CDSSs), which address this challenge. A CDSS allows colla...

متن کامل

Special Issue on Data Provenance: Applications and New Directions

Sharing structured data today requires agreeing on a standard schema, then mapping and cleaning all of the data to achieve a single queriable mediated instance. However, for settings in which structured data is collaboratively authored by a large community, such as in the sciences, there is seldom consensus about how the data should be represented, what is correct, and which sources are authori...

متن کامل

Update Exchange with Mappings and Provenance

We consider systems for data sharing among heterogeneous peers related by a network of schema mappings. Each peer has a locally controlled and edited database instance, but wants to ask queries over related data from other peers as well. To achieve this, every peer’s updates propagate along the mappings to the other peers. However, this update exchange is filtered by trust conditions — expressi...

متن کامل

Provenance and Data Synchronization

Replication increases the availability of data in mobile and distributed systems. For example, if we copy calendar data from a web service onto a mobile device, the calendar can be accessed even when the network cannot. In peer-based data sharing systems, maintaining a copy of the shared data on a local node enables query answering when remote peers are offline, guarantees privacy, and improves...

متن کامل

Use of feldspar grains in provenance determination and the study of transportation and depositional history, examples from central and NW Iran

Feldspar grains, as a significant provenance indicator, of two terrigenous formations from Central Iran, the Upper Red Formation, and Moghan area, Zivah Formation, are used for provenance determination and the study of transportation and depositional history. The Upper Red Formation (URF) is volumetrically the most important siliciclastic unit of the Central Iran and Zivah Formation (ZF) repres...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Data Eng. Bull.

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2010